A 2 2/3 Superstring Approximation Algorithm
نویسندگان
چکیده
Given a collection of strings S = fs 1 ; : : : ; s n g over an alphabet , a superstring of S is a string containing each s i as a substring; that is, for each i, 1 i n, contains a block of js i j consecutive characters that match s i exactly. The shortest superstring problem is the problem of nding a superstring of minimum length. The shortest superstring problem has applications in both data compression and computational biology. It was shown by Blum et al. 3] to be MAX SNP-hard. The rst O(1)-approximation algorithm also appeared in 3], which returns a superstring no more than 3 times the length of an optimal solution. Prior to the algorithm described in this paper, there were several published results that improved on the approximation ratio; of these, the best was our algorithm ShortString, a 2 3 4 {approximation 1]. We present our new algorithm, G-ShortString, which achieves an approximation ratio of 2 2 3. Our approach builds on the work in 1], in which we identiied classes of strings that have a nested periodic structure, and which must be present in the worst case for our algorithms. We introduced machinery to describe these strings and proved strong structural properties about them. In this paper we extend this study to strings that exhibit a more relaxed form of the same structure, and we use this understanding to obtain our improved result.
منابع مشابه
Parallel and Sequential Approximations of Shortest Superstrings
Superstrings have many applications in data compression and genetics. However the decision version of the shortest superstring problem is NP-complete. In this paper we examine the complexity of approximating a shortest superstring. There are two basic measures of the approximations: the compression ratio and the approximation ratio. The well known and practical approximation algorithm is the se...
متن کاملApproximating the Shortest Superstring Problem Using de Bruijn Graphs
The best known approximation ratio for the shortest superstring problem is 2 11 23 (Mucha, 2012). In this note, we improve this bound for the case when the length of all input strings is equal to r, for r ≤ 7. For example, for strings of length 3 we get a 1 1 3 -approximation. An advantage of the algorithm is that it is extremely simple both to implement and to analyze. Another advantage is tha...
متن کاملRotations of Periodic Strings and Short Superstrings
This paper presents two simple approximation algorithms for the shortest superstring problem, with approximation ratios 2 3 ( 2:67) and 2 42 ( 2:596), improving the best previously published 2 4 approximation. The framework of our improved algorithms is similar to that of previous algorithms in the sense that they construct a superstring by computing some optimal cycle covers on the distance gr...
متن کاملBetter Approximation Algorithms for Maximum Asymmetric Traveling Salesman and Shortest Superstring
In the maximum asymmetric traveling salesman problem (Max ATSP) we are given a complete directed graph with nonnegative weights on the edges and we wish to compute a traveling salesman tour of maximum weight. In this paper we give a fast combinatorial 3 4 -approximation algorithm for Max ATSP. It is based on a novel use of half-edges, matchings and a new method of edge coloring. (A half-edge of...
متن کاملA 223 { Approximation Algorithm for the Shortest
Given a collection of strings S = fs 1 ; : : :; s n g over an alphabet , a superstring of S is a string containing each s i as a substring; that is, for each i, 1 i n, contains a block of js i j consecutive characters that match s i exactly. The shortest superstring problem is the problem of nding a superstring of minimum length. The shortest superstring problem has applications in both data co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Discrete Applied Mathematics
دوره 88 شماره
صفحات -
تاریخ انتشار 1998